Synthesis of Sound Textures with Tonal Components Using Summary Statistics and All-pole Residual Modeling
نویسنده
چکیده
The synthesis of sound textures, such as flowing water, crackling fire, an applauding crowd, is impeded by the lack of a quantitative definition. McDermott and Simoncelli proposed a perceptual source-filter model using summary statistics to create compelling synthesis results for non-tonal sound textures. However, the proposed method does not work well with tonal components. Comparing the residuals of tonal sound textures and non-tonal sound textures, we show the importance of residual modeling. We then propose a method using auto regressive modeling to reduce the amount of data needed for resynthesis and delineate a modified method for analyzing and synthesizing both tonal and non-tonal sound textures. Through user evaluation, we find that modeling the residuals increases the realism of tonal sound textures. The results suggest that the spectral content of the residuals has an important role in sound texture synthesis, filling the gap between filtered noise and sound textures as defined by McDermott and Simoncelli. Our proposed method opens possibilities of applying sound texture analysis to musical sounds such as rapidly bowed violins.
منابع مشابه
Comparison of Linear Prediction Models for Audio Signals
While linear prediction (LP) has become immensely popular in speech modeling, it does not seem to provide a good approach for modeling audio signals. This is somewhat surprising, since a tonal signal consisting of a number of sinusoids can be perfectly predicted based on an (all-pole) LP model with a model order that is twice the number of sinusoids. We provide an explanation why this result ca...
متن کاملSound Texture Perception via Statistics of the Auditory Periphery: Evidence from Sound Synthesis
Rainstorms, insect swarms, and galloping horses produce "sound textures"--the collective result of many similar acoustic events. Sound textures are distinguished by temporal homogeneity, suggesting they could be recognized with time-averaged statistics. To test this hypothesis, we processed real-world textures with an auditory model containing filters tuned for sound frequencies and their modul...
متن کاملNumerical study on the acoustic field of a centrifugal fan and the tonal noise sources
The widespread use of squirrel cage fans, especially in ventilation and home and industrial environments, has led to the formation of many research efforts to improve performance and reduce the sound produced by this type of fan. In the literature, the most important factor in generating sound in this category of fans is the confrontation between the rotor exit flow and the volute of the fan. I...
متن کاملPsychoacoustic evaluation of tonal components in view of sound quality design for high-speed train interior noise
1. Introduction In high-speed trains, tonal components produced either by the motor or by corrugated rails may occur. As it is well-known from literature ([1], [2], [3]) that environmental or artificial noise containing tonal components is more annoying than noise without tonal components, the present investigation assesses the importance of reducing such tonal components. Therefore, typical so...
متن کاملCascaded Amplitude Modulations in Sound Texture Perception
Sound textures, such as crackling fire or chirping crickets, represent a broad class of sounds defined by their homogeneous temporal structure. It has been suggested that the perception of texture is mediated by time-averaged summary statistics measured from early auditory representations. In this study, we investigated the perception of sound textures that contain rhythmic structure, specifica...
متن کامل